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Abstract 

Dependence coefficients have been widely studied for Markov processes defined by a set of 
transition probabilities and an initial distribution. In this paper, we provide new tools to check 
the convergence rates of copula-based Markov chains. We also improve some known results of 
Beare and provide a necessary condition for Markov chains based on Archimedian copulas to be 
exponential /O-mixing. Two general necessary condition on copulas to generate exponential rho 
mixing are given. 
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1 Introduction 

This work is motivated by a paper by Xiaohong Chen, Wei Biao Wu and Yanping Yi and a result 
of B.K. Beare. In the paper |12| . it was shown that Markov processes generated by the Clayton, 
Gumbel or Student copulas are geometrically ergodic. They used in their paper quantile transfor- 
mations and small sets to show geometric ergodicity, but could not handle for instance the mixture 
of these copulas. In the paper [7j, we have shown that these examples are actually exponential 
rho mixing and also answer the open question on geometric ergodicity of convex combinations of 
geometrically ergodic Markov chains. But we failed to relax the conditions of Beare's theorem to 
obtain exponential p-mixing rate without ^-mixing rate. This task is one of the major concerns of 
this paper. 

Quantifying the dependence among two or more random variables has been an enduring task 
for statisticians. A rich set of dependence measures has been proposed, including the well-known 
Pearson's correlation coefficients, Kendall's r and Spearman's p for bivariate random variables. 
While these measures are simple and can be easily computed, they are designed to capture only 
certain aspects of dependence. Indeed, it is rather unreasonable to expect a single scalar measure to 
have the capability to quantify all the dependence existing among the random variables. Copulas 
are full measures of dependence among components of random vectors. Unlike marginal and joint 
distributions, which are directly observable, a copula is a hidden dependence structure that couples 
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a joint distribution with its marginals. An early statistical application of copulas can be found in 
[3J , where the dependence between two survival times in a multiple events study is modeled by the 
so-called Clayton copula 

C(x,y) = (x-" + y-"-l)-i/" a>0. 

The literature on copulas is growing fast; an excellent overview, guide to the literature and appli- 
cations can be found in [6]. In later research into copulas, a driving force has been in financial risk 
management for modeling dependence among different assets in a portfolio. f8] can be regarded 
as one of the best books for an introduction to copulas. We also define a new list of copula with 
functional parameters, that can be used in various model. 

1.1 Definitions 

1.1.1 2-Copulas 

A 2-copula is a function C : [0, 1] x [0, 1] — t- [0, 1] = / with the following properties: 

1. C{0,y) = C{x,0) = - meaning that C is grounded; 

2. C{l,x) = C{x, 1) = X - meaning that each coordinate is uniform on /; 

3. for aU [xi, X2; yi, 2/2] C I^. 

C{xi,yi) + C{x2, 2/2) - C(xi, y2) - C{x2, yi) > 0. 

The definition doesn't mention probability, but in fact these conditions imply that C is the joint 
cumulative distribution function of two random variables with margins uniform on /. 

It follows from the definition (see [8| chapter 1]) that the function C is non-decreasing in each 
of the parameters, has partial derivatives a.e with values between and 1. The partial derivative 
of C are non-decreasing in the other parameter. A convex combination of 2-copulas is a 2-copula. 
An m-copula is defined for m > 2 by induction: 

1. C{xi, ■ ■ ■ ,Xi_i,0,Xi+i,- ■ ■ ,Xm) = Vi; 

2. C{xi, • • • , Xi-i, 1, Xj+i, • • • , Xm)- is a (m-l)-copula Vi; 

3. C{V)a(y) > For all rectangle in I™", where the sum is taken over all vertices of the 
rectangle, and (7(V) = ( — 1)", n- number of initial components of the basic intervals of the 
rectangle present at V. 

1.1.2 Sklar's Theorem 

If Xi,--- ,Xm are random variables with joint distribution F and marginal distributions 
Fi, - ■ ■ , Ffn , then the function 

C{Fi{xi),--- ,F>m{Xm)) = F{xi,--- ,Xm) 

is a copula with the properties given above. Moreover, if the random variables are continuous, 
then the copula is uniquely defined by the joint distribution and the marginal distributions by the 
formula 

F(Ff ^(Xl), • • • ,F^'^{Xm)) = C(xi,-- - ,Xm)- 

The implication of the Sklar's Theorem is that, after standardizing the effects of marginals, the 
dependence among components of X is fully described by the copula. Indeed, most conventional 
measures of dependence can be explicitly expressed in terms of the copula. 

"This shows that the information concerning dependence properties is actually hidden in the 
copula. So, they present a natural framework for many investigations "- Nielsen. 
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1.1.3 Copulas and Markov processes 

Copulas have been shown to be a more flexible way to define a Markov process, as in the case when 
one suspects that the marginal distributions of the states are not related to the distribution of the 
initial state. Using copulas will allow changes in single marginal distributions, without having to 
change all other distributions in the chain. 

A stationary Markov chain can be defined by a copula and a one dimensional marginal distri- 
bution. For stationary Markov chains, the transition probabilities are equal to the derivative of the 
copula with respect to the parameter we condition on (for more details, see [H theorem 3.1]). This 
relationship was used in [12] to show that stationary Markov processes defined by the Clayton, 
Gumbel or Student copulas are geometrically ergodic. These notions will be defined later on. 

We will use in this paper the following conventional notation: 

1.1.4 Notations 

1- Ljix) = {g{x) : J^g'^{x)f{x)dx < oo}; 

2- Llix) = {g{x) : f^g^{x)f{x)dx < ooj^g{x)f{x)dx = 0}; 

3- lls-lli = Ii9'^{x)dx; 

4. AAxuX2) = '-^-, 

5. c{x,y)- density of C{x,y). 

1.1.5 Dependence coefficients 

Many dependence coefficients have been studied in the literature, such as a„, /3„, pn, <pn among 
others. In our paper, we will mainly use the last 3 coefficients, that are defined as follows: 
Given cr-fields A, B: 

^(A,B) = Esup \P{B\A) - P{B)\ 



/9(A,B)= sup corr{f,g), 

/eL2(A),geL2(B) 

(^„(A,B)= sup \P{B\A) - P{B)\ 

.,AgA,P(A)>0 



Given the alternative form of the transition probabilities for a Markov chain generated by a copula 
and a marginal distribution with positive density, it was shown in [7] that these coefficients have 
the following simple form when the copula is absolutely continuous with density c: 

Pn= I sup\ {cn{x,y) - l)dy\dx, 

Jo Bcl Jb 

= sup ess sup I / {cn{x,y) - l)dy\, 
Bci X Jb 

Pn = sup{ C C Cn(,x,y)f{x)g{y)dxdy : H^lb = H/lb = 1,E(/) = E{g) = 0}. 
Jo Jo 

Here, c„ is the density of the random variable (Xo,X„). In general the following inequalities hold 
(see [21 theorem 7.4, 7.5] for more.): 

n ) Pn <2^n, Pn<{piT. (1) 
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These coefficients are defined to assess tfie dependence structure of tfie Markov process, and provide 
necessary conditions for CLT and functional CLT necessary for the statistics based on the studied 
model. Some examples can be found in |9], |10) and the references therein. A stochastic process is 
said to be respectively an a-mixing (/3-mixing or p-mixing), if a„ — )• (/3„ — )• or /9„ — )• 0). The 
process is exponentially mixing, if the convergence rate is exponential. A stochastic process is said 
to be geometrically ergodic, if for some a S (0, 1) , and n S N, 

sup \P{Xn G B\Xo = x)- P{Xn G -8)1 < a'^Wix), with E(VF(Xo)) < oo. 

Bex 

By theorem 2.1 of Nemmelin and Tuominen (1982), geometric ergodicity is equivalent to exponential 
P -mixing. So, whenever we encounter the term, we may understand either of the two concepts. 
The main results on this topic are due to Darsow, Nguyen and Olsen (1992), de la Pefia, Ibragimov 
and Sharakhmetov (2006) and Ibragimov (2009), who were among the first to study copula-based 
Markov processes. They characterized copula-based Markov chains. Joe (1997) proposed a class 
of parametric (strictly) stationary Markov models based on parametric copulas and parametric 
invariant distributions. Their setup of the problem was modified by Chen and Fan (2006), who 
studied a class of semi-parametric stationary Markov models based on parametric copulas and 
nonparametric invariant distributions and analyzed the strength of dependence in the Markov 
chain. They showed that the temporal dependence measure is fully determined by the properties 
of the copulas. Studying the strength of the dependence, Beare (2008) provided simple sufficient 
conditions for geometric /3-mixing in terms of copulas without any tail dependence. None of these 
papers is able to verify whether or not a Markov process generated via a tail dependent copula such 
as Clayton or others, is geometric p-mixing. 

The tail dependent copulas are the most popular copulas because they suitably model shocks 
in economics. In an effort to address this issue, Ibragimov and Lentzas (2008) demonstrated via 
simulation that Clayton copula-based first-order strictly stationary Markov models could behave 
as "long memory" in copula levels (meaning that dependence on the initial distribution tends to 
persist). But, Xiaohong Chen, Wei Biao Wu and Yanqing Yi (2009) showed that this simulation is 
misleading, because the copula-based models studied by Ibragimov are actually geometric /3-mixing. 
In conclusion, although a time series plot may look highly persistent and "long memory alike," the 
time series may still be weakly dependent and "short memory " so simulating isn't a good way to 
handle the issue. 

Recently in a discussion with Beare, I learned he had published a result improving the results 
in [12J. Beare provided in a recent paper the proof based on simulation, that the Clayton copula 
provides exponential rho-mixing. The theoretical proof of this fact and many others was given by 
M. Longla and M. Peligrad in [7]. 

The paper is structured as follows: In section two below we provide new results on exponential 
/9-mixing, and exponential /3-mixing for some families of copulas that were not studied before and 
some new families of copulas that generate exponential p-mixing and 0-mixing are given. In section 
3 we provide the proofs of all theorems. 

2 Results on dependence coefficients 

We shall recall that exponential p-mixing is equivalent to pi < 1 for Markov chains. This fact will 
be used in this section to improve some known results. 
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2.1 General condition for exponential /o-mixing 

Theorem 1 A copula-based Markov chain generated by a symmetric copula with square integrable 
density is geometric /3-m.ixing if it is geometric p-mixing. 

This means, that for a Markov chain generated by a copula with square integrable density and an 
increasing marginal distribution, exponential p-mixing implies geometric ergodicity. 

Theorem 2 Let c be the density of the copula C . Assume 

lim yc(x, y) = lim(l — y)c{x, y) = 0; (2) 

\cy{x,y)\dy ^ L2{[^,1]), \\ \cy{x,y)\dy\\l = k2; (3) 

Jo 

c(a;, 1) - c(x,0) G L2([0, 1]), and \\\c{x , 1) - c{x , 0)\ -\- \cy (x, y)\dy\\l = ki < oo. (4) 

Jo 

If ki + k2 < 12, then the Markov process generated by C is an exponential p-mixing (pi < 
\l (k\ + A;2)/12 <\), and therefore geometrically ergodic if the chain is reversible. 

2.1.1 Example of the Farlie-Gumbel-Morgenstern family of copulas 

C{x, y) = xy + 9xy{l -x){l-y), c{x, y) = 1 + 6(1- 2x){l - 2y), Cy{x, y) = -20(1 - 2x), 
€ [—1,1]. For this family, all assumptions of the theorem are met. 

c(x, 1) - c(x,0) = -20(1 - 2x), f \cy{x,y)\dy = 2\9{l-2x)\; 

Jo 

|c(x,l) -c(x,0)| + / \cy{x,y)\dy = 4\e{l-2x)\. 
Jo 

Therefore, ki = 46*2/3 and /c2 = 16073. ki + k2 < 12 if 0^ < 9/5. This is true for all G 
[—1,1]. Therefore, the Farlie-Gumbel-Morgenstern family generates exponential p-mixing and an 
exponential /3-mixing for all its parameters. 

Beare proved that for a copula with density bounded away from zero we have exponential p- 
mixing. These conditions imply actually a stronger mixing condition {(p) as shown in [7J. This 
conditions are relaxed in the following theorem: 

Theorem 3 If the copula of the Markov process is such that there exists nonnegative functions ei, 
82 defined on [0, 1] for which the density of the absolute continuous part of the copula denoted c{x,y) 
satisfies the inequality 

c{x,y) > ei(x) + e2{y) 

with El, £2 G Li[0, 1] such that at least one of the two functions has a non-zero integral, then the 
process is exponential p-mixing, and thus exponential /3-mixing if the density appears to be positive 
on a set of measure 1 or square integrable. 

Remark 4 The presence of the two functions in the theorem is just to emphasize that it doesn't 
matter which of them ends up providing a lower bound on the absolute continuous part of the 
density. The question is irrelevant if the copula is symmetric, but is an issue in general. To apply 
the theorem, one just needs to find any positive function of one variable that can allow a lower 
bound on the density with the stated property. 
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Remark 5 This theorem improves Beare's result /TJ Theorem 4-^]> by extending it to cases when 
the density can actually be equal to zero on a set of non-zero measure, and therefore not be bounded 
away from 0. This is an improvement that avoids the conditions of Beare that allowed to show 
(j)-mixing rate in f^. All examples from his paper are therefore covered by this theorem, and in the 
example below, we can exhibit a copula that provides exponential p-mixing, but was ruled out by his 
assumptions. 



2.2 Examples 

1. The Plankett copula family is defined by 



i + (9-i){u + v)- V(i + (g - i)(n + v)r - Auveje^) 

C{n, V) = (5) 

were 6 is the constant cross product ratio 

^ P{U >u,V > v)P{U <u,V <v) 



P{U >u,V < v)P{U <u,V >v) 

and the density is given by 

i{i + e) ((1 + {9-i){u + v)f - Auve{e - 1)) 

f V^(l + {9- l){u + z;))2 - Auve{e - V 

1 ({9 -l){u + v) + l- 2ue\ ({9 - l)(u + v) + l- Z.VO I 

-■ (6) 

+ {e-i){u + v)y - iuvei9 - 1)) 

For 9 > 1, The denominator of this density is less than or equal to 1 + (0 — l)(n + < 
1 + (0 — 1)(1 + ti). The numerator is a polynomial in with non-zero minimum in v for fixed 
values of u. Therefore, this density is bounded from below by an integrable function of u. 
For 9 < 1, we can divide into two cases: 9 < 1/2 and 1/2 < 9 < 1. In both cases, it is easy 
to bound the denominator from above and the numerator from below to get a lower bound 
on the density of the copula. Thus, by theorem [31 this copula family provides an exponential 
p-mixing for all values of the parameter. 

On this example, one can actually see the power of this theorem. For 9 > 1 the density is not 
bounded from above (it would therefore be (/)-mixing), it is not square integrable (it would 
therefore fall in the range of theorem [1]) and it is not bounded away from (it would therefore 
fall in the range of Beare's theorem). 

Given any functions h : [0, 1] — t- [0, 1], g : [0, 1] — t- [0, 1], the function 

supg_- g{x)h{y) + h{y)\\g\\i + g{x) 

y) — 



sup5 + ||5||l||/^||l 

is the density of a copula that generates an exponential p-mixing Markov chain. 

(The proof uses theorem [3] with ei(x) = — ^^TTrTT%rTi- 1 £2(v) = — ^n^n%%7^)- We have 



pi < 1 - = < 1 usmg Theorem |3| 

2sup5+ 5 1 1 SUP5+ 5 1 1 
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3. Given any functions h : [0, 1] — t- [0, 1], 5 : [0, 1] — t- [0, 1], the function 

sup g sup h - g{x)h{y) + h{y)\\g\\i + g{x)\\h\\i 
sup5fsup/i + ll^llill^lli 

is the density of a copula that also defines an exponential p-mixing Markov chain. We have 



^ ^ _ 1 2||g||i||/i||i ^ sup g sup /t ^ ^ 

~ 2sup5(sup/i + ||5(||i||/i||i supgsup/i + ||5(||i||/i||i 

4. Given any functions h : [0, 1] — t- [0, 1], 5 : [0, 1] — t- [0, 1], the function 

^ ^ sup (7(sup h — inf h) — 5'(x)(sup h — h{y)) + (sup h — h{y))\\g\\i + 5f(x)(sup h - 
' sup5((sup/i — inf h) + ||5(||i(sup/i — ||/i||i) 

is the density of a copula that also defines an exponential /3-mixing Markov chain. We have 

^ sup g(sup h — inf h) ^ 

~ sup(7(sup/i — inf /i) + ||(7||i(sup /i — ||/i||i) 

5. Given any functions h : [0, 1] — t- [0, 1], g : [0, 1] — t- [0, 1], the function 

^ ^ (sup^r — inf (7)(sup /i — inf /i) — (sup (j( — (7(a;))(sup /i — ^ 
' (sup g — inf 5) (sup h — inf h) + (sup 5 — | l^l |i)(sup h — ||/i||i) 

^ (sup/t - /i(j/))(supg - llfflli) + (supff - g{x)){suph- \\h\\i) 
{supg - va.ig){suph- inf /i) + (supg - ||5||i)(sup/i - 
is the density of a copula that also defines an exponential /)-mixing Markov chain. We have 

^ (supg - inf g)(sup h - inf h) ^ ^ 

~ (sup (7 — mi g){suph — inf /i) + (sup (7 — ||(7||i)(sup /i — ||/i||i) 



6. The following density also generates an exponential p- mixing with pi < - — ^ 3° < 1. 

^ + 1 + (1/2 - y)2;i/"-isi5n(l/2 - x^/") 



cix,y) = ^ aG(0,l] (7) 

This density is bounded away from 0, thus it is also providing </>- mixing. 
7. Finally, the bivariate density function 

c{x,y) = l + —x^^°'-\2y-l)sign{l/2-x^/°') ee[-2a,2a], a G (0, 1] 
2a 

generates exponential (/>- mixing for |^| < 2a. 
In the last inequality of the proof of theorem [3l we obtain a guarantee of the following result: 
Lemma 6 Let f{x,y) be a nonnegative function in Li[0, 1] with the following properties: 
1- fif{x,y)dx = 1 a.e; 
^- Iif{x,y)dy = 1 a.e; 

Then, if there exist nonnegative functions ei and 82 in Li[0.1] such that f{x,y) > ei{x) + £2(2/) 
a.e, then 

and 



J ei{x)dx + J £2{x)dx < 2 
f{x,y)g{x)h{y)dxdy < (l- -(^J £i{x)dx + J £2ix)dx^^(^ J g^{x)dx^ j h^{x)dx^^ 
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2.3 Exponential /o-mixing for Archimedean copulas 
2.3.1 Archimedian copulas 

Archimedian copulas have been studied by many researchers and are very popular because of the 
associativity property that they share. A result by Beare (2010) is the first about general necessary 
conditions for exponential /?- mixing. He proved that under some mild conditions, some strict 
Archimedian copulas generate exponential /o-mixing. 

Archimedian copulas are defined in [8] as follows: Given a function ip : [0, 1] — t- [0, oo) such that: 
is a decreasing concave up function (positive second derivative whenever it exists), (/?(!) = 0; if 
y?(0) = oo, then (p is called a strict generator or generator of a strict Archimedean copula given by 
the formula: 

C{u,v) = Lp ((/?(«) + (/7(u)); c{u,v) = — ^3 



If 9^(0) < oo, then ip is called non-strict generator or generator of a non-strict Archimedean 
copula defined by 

C(u, v) = 99~^(min {Lp{u) + (p(v) , (piQ))) . 

A non strict generator can always be standardized to have (/^(O) = 1. It can be proved that all 
generators of the same Archimedean copula are scalar multiples of the standard generator (I will 
provide a proof of this in the appendix). So, without loss of generality I can state all my results 
with the standard generator, and convert later on to any initial generator. 

Theorem 7 Let ip be a non-strict standard generator of an Archimedean copula not equal to the Ho- 
effding lower bound. Assume ip is twice differentiable and its second derivative is decreasing. Then, 
the copula generates an exponential p-mixing Markov process if one of the following inequalities is 
satisfied: 

, (l-x)( . , ^^f. ..X dx<l, or [ h\x){l - x)dx < {ip' for ^'{1)^0 
\{ipoip ^{x)y/ Jo 

where h{x) = max ip oip~^ {x -\- y); 
0<y<l—x 

h{x) = ip (0) if ip is decreasing and h{x) = ip OLp'~^{x) if ip is increasing. 
2.3.2 Example 

The Archimedean copula with generator ip{u) = — hi{6u + 1 — 9). The standard generator is 

, , IniOu + l-e) ^ , , 
^(-)= ln(l-.) ' 

y^) = a ' (a;) = — — 7 T3fl7> 'Pop> {x) 



9^(1) 



ln(l-0)(x+i/)' - - ^ ' ln(l-0)(l 
ip op \x + y) = — ^-j— 77 — — , h{x) = — )— — 



\n{l-ey ^ ^ ' -ln(l-0) ' ^' -ln(l 
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0^ '\{ip'oip-^{x)yJ 4(1-0)4^16 16(1-0)4^ 

So, we obtain exponential p-mixing for 9 £ {0,9o), where — -348 is the unique value of theta for 
which the inequality becomes an equality. 

2.3.3 Remark 

This example is taken from [SJ §6], from the list of Archimedean copulas that have been used 
in applications. On this example we can see that the theorem doesn't handle the case of the 
independence copula, corresponding to 9 = 1. This is due to the fact that in this limiting case, 
the copula becomes strict. We might be tempted to include the case = by only looking at the 
final inequality. A closer look at the original generator shows that for this value, the function is 
constant! On the other hand, the limiting copula is the Hoeffding lower bound, which is ruled out 
by the assumptions of the theorem. 



2.3.4 Example 



1 + (t^ — Ijx 



, -9 , _i (i + (g-i)x)2 „ 29{9-l) 



(l + (0-l)x)2' ^ ^ ^ ^ -e ' ^ V . (1 + 1)^)3- 

Again, we can see that the case = 1 is ruled out, as (f" (x) = for this value, ip is decreasing, so, 

f{9) is an increasing function on [l,oo) with /(I) = 0, /(oo) = oo. Thus, there exists a unique 
9q ~ 1.388 for which f{9o) = 1. Therefore, the copula generates exponential p-mixing Markov 
chain for 9 € (1, ^o)- 

2.4 Other results 

In the paper [7], a result reads as follows: 

Lemma 8 Any convex combination of geometrically ergodic reversible Markov chains is geometri- 
cally ergodic. 

Two simple applications of this lemma lead to the following corollaries: 



Corollary 9 Given two Markov processes with transition kernels P and Q, If P is geometrically 
ergodic and Q is irreducible, reversible and has a spectral gap as operator in L2(x, /i), then aP + 
(1 — a)Q is geometrically ergodic and has a spectral gap in L'^{x,h)- 

Corollary 10 The Markov chain generated by the convex combination of symmetric copulas, whose 
Markov processes are geometrically ergodic is also geometrically ergodic and exponential p-mixing. 
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2.4.1 Exponential /3- mixing for the Prechet and Mardia families of copulas 

C(x, y) = ^!il±^M(x, y) + {l- e^)P{x, y) + ^'^^ ~ ^ V (x, y), G [-1, 1] (8) 

C{x,y) = aM{x,y) + {l-a- l5)P{x,y) + l5W{x,y) (0 < q + /3 < 1) (9) 

([8]) defines the Mardia family of copulas and ([9]) defines de Frechet family of copulas. Notice that 
a Mardia copula is a Frechet copula with a + /3 = 9^. The copulas W and M are the Hoeffding 
lower and upper bounds respectively and P is the independence copula. For any copula from this 
family, there exists a singular density 

C, v) = al{u - V < 0) + /3I(1 -u-v<Q) + {l-a- /3)v; 

c{u, v) = a6u{v) + f36i-u{v) + 1 - a - /3. 

If the Markov chain is generated by a copula from the Frechet (Mardia) family with (q + /3 7^ 1) 
{9 7^ 1), then the n steps joint distribution is given by 

C"(x, y) = C„„,;3„(x, y); a„ = ^[{a + /3)" + (a - /3)"], /3„ = ^[(« + - (a - /3)"] (10) 

and the process is exponentially /3-mixing. For the excluded parameters there is no mixing. 

Remark 11 1. First of all, notice that theorem\^ could he applied to both families to show that we 
have exponential p-mixing for cases when we have geometric ergodicity above; but the theorem 
wouldn't answer the question for the rest of the values of the parameters. 

2. On this example as in the previous examples, we find the copula W as a limiting copula in the 
family, and there is exponential p-mixing for admissible parameters in the neighborhood of the 
parameter that corresponds to it. This opens some question for further research: What is the 
reason of such behavior? Is it always the case? 

2.4.2 Practical Example for the Metropolis Hastings Algorithm 

The popular kernel of the independent Metropolis Hastings Algorithm, that is used to generate 
Markov chains with a given probability of staying at the same state x equal to p{x), where x G 
[— 1, 1], is defined by: 

Q{x, A) = p{x)6a{x) + {1 — p{x))p{A), where is a measure on [—1,1]. 
If ^ = J^^ i-p(l) < then the invariant distribution of this process is vr defined by 

p{dx) 



7r{dx) 



9(1 -p{x))' 



Therefore, if we allow the acceptance probability to depend on a parameter /3, and require the 
marginal distribution to be uniform on [—1, 1] and p, absolutely continuous with respect to the 
Lebegues measure with density h{x,(3), then we obtain the following: 

Qix, A) = p{x, /3)6a{x) + (1 - pix, /3))piA); 
Tr{dx) = ^dx h{x, (3) = k{l - p{x, /?)), 9 = 2k. 
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The corresponding copula representation is therefore defined by 

c,i(^, ^) = Pix, mx <y)+ kil - pix, 15)) j\i - Pit, mt. 

So, the derivative of the copula is 

/2d— 1 r-x 
{l-p{t,p))dt, f{x) = J p{x,/3)dt. 

Therefore, C{u,v) = ^ f (^miii{2u - l,2v - 1)^ + k{2u - f{2u - l)){2v - f{2v - 1)) , (11) 
For this process, if we take p{x,f3) = /3|x| with /3 < 1, then due to (Sj Lemma 2], 

oon/2+l 

f3n < 3E^(p"/2(X, /?)) = <kp^, p=y^; 



On the other hand, Q(x, ■)g{x) = p{x)g{x) = I3\x\g{x) for g G L2fl[—1, 1) implies 

||g||2</3(' sup / \x\^g^{x)dx\^''^ < I3{ sup / ^^(x)^^) = /3 < 1. 

Therefore, this process is both exponential p- mixing and exponential /3- mixing. For (3 = 1, this 
process is a /3-mixing with 1/n decay rate, but fails to be p- mixing. In general, any function of the 
form (|lip with an increasing differentiable function /, such that /(— 1)=0, /(1)=2 — 1/A; will 
define a one parameter copula family with 1 > k > 1/2. This family will provide an exponential 
/9-mixing for 1/2 < A; < 1. 

Remark 12 The Farlie-Gumhel Morgenstern's example provides actually exponential (p-mixing 
when parameters are in the interior of the domain, and this theorem allows to prove the mixing 
rate for the boundary values. Theorem\^ could also he applied to this problem to equivalently obtain 
the same result for 9 G (—1, 1). Theorem\^ would not allow to investigate the boundary cases. This 
shows that these theorems are complementary, and do not follow from each other. 

3 Appendix 1: Mathematical proofs 
3.1 Proof of theorem [1] 

The square integrable density defines a Hilbert-Schmidt operator, and therefore a compact operator. 
So, there exist a basis of eigen- functions in ^2(0, 1). Reversibility implies a spectral representation 
of the operator in the form 

00 

c{u,v) = \i4>i{u)4>i{v), 

where (j)i{u) are the eigen- functions corresponding to the eigen- values Aj, and form an orthonormal 
basis of L2. All Aj exist and are positive real numbers in the decreasing order. 1 is eigen- value with 
eigen- function 1. Therefore, A^ are eigen- values of the operator c^ with the same corresponding 
eigen- functions. So, 
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Ck{u,v) = 1 + ^ Af(/>i(u)^i(f), 

i=l 

Pk = sup{| /' Cfc(n, v)f{u)g{v)dudv\ : E(/) = W.{g) = 0, E(/2) = ^{g^) = 1} 

JO 

Ck{u,v)f{u)g{v)dudv\ = | / (l)i{u)f{u)du / 0i(n)5((t>)(it'| < 



Then, for k > 2, we have 



oo oo 



Here c = (X^^^i A?)/Af ; and the series converges because we have a Hilbert-Schmidt operator. 
Therefore, if Ai < 1, then converges to exponentiahy. On the other hand, if Ai = 1, then 



Ck{u,v) = I + X'lMu)Mv) + ^^iMu)(l)i{vy, 

i=2 

Because the basis is orthonormal, we have < 1, (j)i{u) >= (f)i{u)du = for i 7^ 1; and 

< Ck{u,v)(j)i{u),Mv) >= Ai / (l)l{u)du < Mv),h{v) >= Xi- 

Jo 

Thus, 1 > Pk ^ Ap Therefore, if Ai = 1, then p^ = 1 for ah k. So, we have exponential p-mixing if 
and only if pi < 1. 

Now, recall /3fc = / sup | / {ck{u,v) — l)du\dv. So, 

JO B J B 



I sup| / (y \\(t>i{u)(l)i{v))du\dv <^ X\ / \(l)i\{v)dvsnY> / \^i\{u)du < 

B Jb ~[ ~{ JO B Jb 

<f;Af fmv)dv,nvp"HB)( [ 

~l Jo B ^Je 



^ 1/2 

0i {u)du 



B 

00 „i 00 „i X 1/2 °° 

i=l JO 



Therefore, < cX^. So, /3fc converges exponentially to when pi < 1. 

3.2 Proof of theorem [3] 

Let f, g be two functions with \\f\\2 = \\g\\2 = 1,E(/(X)) = E{g{X)) = 0. We have 2f{x)g{y) 
fix) + g\y) - (fix) - g{y)f. Therefore, 
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2/ f{x)g{y)C{dx,dy)= [ f{xfC{dx,dy)+ f g{yfC{dx,dy)-f {f{x)-g{y)fC{dx,dy). 
J12 Jp Jp Jp 

But, [ f{xfC{dx,dy)= [ f{xf [ C{dx,dy)= [ f{xfdy = l= [ g{yfC{dx,dy), 
Jp Ji Ji Jp Jp 

(fix) - 9{y)fC{dx, dy) > [ (fix) - g{y)f{ei[x) + e2{y))dxdy = 
P Jp 

U\x) + g\y) - 2f{x)g{y)){ei{x) + e2{y))dxdy = h + h 

P 

h= f{x)e2{y)dxdy + j g'^ {y)£2{y)dxdy - 2 j f{x)g{y)£2{y)dxdy 
Jp Jp Jp 

Ia= I f{x)ei{x)dx dy + ei{x)dx / g^{y)dy-2 / f{x)ei{x)dx / g{y)dy = 
Ji Ji Ji Ji Ji Ji 

f'^{x)£i{x)dx + J ei{x)dx > J £i{x)dx 
Similarly, lb > Jj e2{y)dy. Thus, 



{f{x)-g{y)) C{dx,dy) <- ei{x)dx - / e2{y)dy 
p Ji Ji 

This implies 2 J f(x)g{y)C{dx,dy) <2 — {J ei{x)dx + J e2{y)dy) 

Or simply 

corr{f,g) < 1 - ]^{j^ei{x)dx + j^e2{y)dy) 
Because this holds for all such f and g, it holds for f and —g. Thus 

\corr{f,g)\ < 1 - ^{j^ei{x)dx + j ^e2{y)dy) 

Provided that one of the two integrals of the right hand side is non-zero (say e), we obtain 

\corr{f,g)\ < 1 - < 1. 

The last equation shows that pi < 1, as e is independent of f and g. Combining this with the 
previous theorem, we obtain exponential p-mixing. 

3.3 Proof of theorem [7] 

Let X = <f{U), Y = ifiV). Using conditional probabilities or the transformation formula, we find 
the joint density of (X, Y): 

Hx, y) = — '-^"^^Ig ljx + y < 1}. 
[ip oip-^ix ^y) ^ 
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We have 



Pi{U,V) = sup corr{f{U),g{V)) = swpcorr{fo(p ^ {(p{U)) , go(p ^{(p{V))) = 

f,9 f,9 



= snp corr{f{X),g{Y)) = pi{X,Y). 

1,9 

The second equahty seems obvious, but needs clarifications. In fact the function ip~^ makes sense 
only for < x, y < 1. This condition is satisfied, because we use the density on the < a; + y < 1. 
Otherwise, wc would not be able to have Lp^^OLp{X) = X. The third equality holds for / = foLp, 
g = go(p. But it is not clear that any square integrable function of X, Y can be written in this 
form. This equality holds, because we can write the same chain of equalities starting from X, Y, and 
obtain two inequalities leading to the given conclusion. Using this fact, we can therefore compute the 
maximal correlation coefficient oi X,Y , and bound it to get a condition for exponential p- mixing of 
the Markov chain generated by the initial copula. Let / and g be such that K{f{X)) = E{g{X)) = 0, 
Var{f{X)) = Var{g{Y)) = 1. Given the formula of the density, the correlation coefficient between 
f{X) and g{Y) can be computed by; 

Therefore, \corr{f{X), giY))\ < C \f{x)\ f'^ f "^"'^^ + ^^'^^^^^1 dydx 

•^0 -^0 \^-ip'oip-^{x + yyj 

ip"^ is decreasing as (p is decreasing, and cp is increasing. Thus, 

= ip-Hl) < cp-\x + y)< p-Hx), p-\y) < p-\Q) = 1; 

(p (Q) = (p oip (1) < p op {x + y) < (p op) [x),p op> [y) < p op? [0) = p (1) < 0; 
11 1 1 

(-<^'(0)) (^-^'op~^ix + y)) (^-p'op-^x)) [-p'op-^y)) {-p{l)) 

Knowing that U,V are uniform on [0,1], we obtain that X, Y share the same distribution with 
CDF and pdf 

P{X <x) = P{Y<x) = l- p?~\x), fxix) = fvix) = , ^ 

—p op \X) 

Therefore, using this density and the bound 

h{x) = max ip op~^{x + y), 

0<y<l—x 

we obtain the two inequalities below, that we investigate using twice the Holder inequality and the 
value 1 for the variance of the random variables X and Y. 

I ifnn .T.^M^ r h{x)\f{x)\ \giy)\dy 
\corr{f{U),g{V))\ < / ^ / -^dx (12) 



[-p'op-^x)) ^0 (-^'o^-Hz/)) 



[-pop i(x)J \^-(pop} i(y)J 
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The inequalities [T2l and [T3l above lead to 

.r.rr^ .X M /" ^ (a^) I / (x) I ( 1 - x) .^l. y ) ^2 , \ 1/2 



if' oip i(x) 

Therefore, taking the supremum over all possible functions / and g, we obtain in both cases: 
So, pi < 1 if 

/V , , ^^f, .., )'(! -^)^^<1 or C h\x){l-x)dx<{^'{l)f for (^'(1) / 0. 

Now, if Lp" is decreasing, then = Lp" OLp~^{l) = if" {^), and if 93" is increasing, then h{x) = 

(f o(p~^{x). This takes into account the fact that {p~^ is decreasing, and ends the proof of the 
theorem. 

In general, the function h{x) is easy to compute because it needs only the behavior of the second 
derivative of the generator on the interval of definition. The use of the standard generation helps 
to avoid question about the ranges and domains of function that are used throughout the proof. 



3.4 Proof of corollaries M, and [TOl 

For corollarylH the existence of a spectral gap in L'^{x, ^) is equivalent to L'^{x, fj)- geometric ergod- 
icity for reversible Markov chains. So, the Markov chain generated by P is L^(x, ;^)- geometrically 
ergodic. Therefore, a straight application of lemma [8] leads to the conclusion. 

For corollary [TOl according to |1H theorem 2.1], both Markov processes have a spectral gap 
in L^(x,7r). The lemma states, that therefore, their convex combination has a spectral gap in 
L^(x,'?r). Applying again fill theorem 2.1], the spectral gap is equivalent to geometric ergodicity; 
so, the Markov chain generated by the convex combination is geometrically ergodic. 



3.5 Proof of result [2X11 

Proof The proof of the formula of the n steps transition copula is based on the multiplicative 
property of the copula family and the recurrence relationship that can be easily established between 
n and n + 1 steps copulas. Moreover, any set Su^ = [0,tto], 1*0 < 1/2, is a small set for a + /3 7^ 1, 
as for any ni,tt2 < uq, Su^ C Suq and C Suq, 

P{Xn e Su„Xo G Su,) = C''{ui,U2) > (1 - an - MuiU2 = (1 - o - e Su,)P{Xo e Su,). 

Now, 

\P{Xn G Suo\Xq G Suo) — P{Xq e Suq)\ = I — ^ °' ^1 

Xq 

_ anM{uo,uo) - {an + /3n)uQ + /3nW{uo, up) 

Uq 

= \an- (an + l3n)uo\; take = 1/2; 
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\P{Xn G Suo\Xo G Su,) - P{Xo e Suo)\ = " ^ ^" = -/^r- 

So, we conclude that this goes to zero exponentially fast for —1 < a — /3 < 1); for a = 1 (it 
corresponds to the copula M) or /3 = 1 (it corresponds to the copula W), the Markov process is 
not a /3-mixing. For any convex combination of the two copulas ( corresponding to a + ;5 = 1), we 
can't use this method to provide the answer, as the coefficient in the inequality defining the small 
set is zero, and thus, the sets we are using are not small sets. On the other hand, the problem can 
still be handled. The corresponding transition operator acts as follows: 

Qf{u) = afiu) + (1 - a)/(l - u). 

Therefor, if we can find a function / defined on /, such that E(/) = and /(I — u) = f{u) for all 
u, then will have Qf{u) = f{u), leading to ||Q/||2 = H/lb and ||Q|| = 1. There is a huge class 
of such functions. For instance, fn{x) = cos{2mTx) satisfies these assumptions. Thus, \\Q\\ = 1 
- there is no spectral gap. Therefore, we can conclude that we have exponential convergence for 
all parameters except when a + /3 = 1. By [11^ theorem 2.1] , the Markov process generated by a 
Frechet copula (or Mardia copula) is therefore an exponential /3-mixing and exponential /3-mixing, 
except when they represent a convex combination of M and W. 



3.6 Proof of theorem [2] 

We consider the functional operator T : L2([0, 1]) — t- L2{[0, 1]) defined by: 

T{f){x) = [ f{y)c{x,y)dy, c{x , y) = C ^uix , y) . 
Jo 

Define the operator Tif{x) = T[f(x) — E{f(x))). The following holds: 

||T||= sup mZMk= (14) 
" " /eLO(fo,i]) Il/(^)ll2 /eL.([o,i])ll/(x)-i?(/(x))||2- ^ ' 

Let H = L2([0, 1]) be a Hilbert space. Let T be a bounded operator defined on Hq = L2{[0, 1]), 
and {ei{x),i G N} be an orthonormal basis of H such that E{ei{x)) = for all j ^ 1. Then 



l^lp < ^ ||T( 



i>l 



Proof: 

E{f{x)) = and E{ei{x)) = for i > 1 — ^ oi = 0. 
/(x) = ^a.e,(x)^||/||2 = (5]a?)V2; 

i>l 

i>l i>l i>l 

The last inequality above uses the Holder or Cauchy inequality. This leads to ||r|| < 
(Ei>i ll^ei||i)i/2^ We win bound this norm for the operator that we have defined to prove 
the theorem, and find a sufficient condition for the norm of the initial operator to be less than 1. 
One of the most convenient basis is the following: 



l,ei = V2sin{2i'irx);ei = V2cos{2mx);i = 1,2,- 
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For this basis, we have E{ei{x)) = except for ej = 1 and it's an orthonormal basis of our Hilbert 
space. It remains the evaluate or bound the quantities UTejlH. 
Case 1. Ci = ^/2cos{2i'Kx). 

/■i sin(2iTTv) 1 

{l/V2)Tei{x) = / c{x,y)cos{2Triy)dy = c{x,y) — |^=o - — / Cy{x,y)sin{2my)dy 

Jo 2m " 2nr Jq 

Therefore, for the absolute value of (l/\/2)Tej(x), using \sin{27riy)\ < 1 and we obtain: 

1 

\(l/V2)Teiix)\<— \cy{x,y)\dy 



1 



iTeill^ < — — 2II / \cy{x,y)\dy\\l] 

,2 ^ ^1 







So llreJIn < 



2(i7r)2 

Case 2. Cj = V2sin{2iTrx) . Same as above with the only difference that for this case, the first 
part is not zero, but —{■^)[c{x, 1) — c(x,0)]. So, 

1 

\{l/V2)Tei{x)\ < _[|c(x,l) -c(x,0)| + / \cy{x,y)\dy]. 

1 

ll^eill^ < II — [|c(x,l) -c(x,0)| + \cy{x,y)\dy]\\l 



\Te^\\l < 



2(i7r)2 

Taking into account both cases, using 

Vi - — 

STWT^t M|2 _ fcl + ^2 1 _ fci + ^2 

i>l ^ ' i>0 

This norm is less than 1 for /ci+A;2 < 12, and therefore the generated Markov chain is an exponential 
p-mixing. 
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